On non-asymptotic bounds for estimation in generalized linear models with highly correlated design

نویسندگان

  • Sara A. van de Geer
  • S. A. van de Geer
چکیده

We study an increment bound for the empirical process, indexed by linear combinations of highly correlated base functions. We use direct arguments, instead of the chaining technique. We moreover obtain bounds for an M-estimation problem inserting a convexity argument instead of the peeling device. Combining the two results leads to non-asymptotic bounds with explicit constants. Let us motivate our approach. In M-estimation, some empirical average indexed by a parameter is minimized. It is often also called empirical risk minimization. To study the theoretical properties of the thus obtained estimator, the theory of empirical processes has been a successful tool. Indeed, empirical process theory studies the convergence of averages to expectations, uniformly over some parameter set. Some of the techniques involved are the chaining technique (see e.g. [13]), in order to relate increments of the empirical process to the entropy of parameter space, and the peeling device (a terminology from [10]) which goes back to [1], which allows one to handle weighted empirical processes. Also the concentration inequalities (see e.g. [9]), which consider the concentration of the supremum of the empirical process around its mean, are extremely useful in M-estimation problems. A more recent trend is to derive non-asymptotic bounds for M-estimators. The papers [6] and [4] provide concentration inequalities with economical constants. This leads to good non-asymptotic bounds in certain cases [7]. Generally however, both the chaining technique and the peeling device may lead to large constants in the bounds. For an example, see the remark following (5). Our aim in this paper is simply to avoid the chaining technique and the peeling device. Our results should primarily be seen as non-trivial illustration that both techniques may be dispensable, leaving possible improvements for future research. In particular, we will at this stage not try to optimize the constants, i.e. we will make some arbitrary choices. Moreover, as we shall see, our bound for the increment

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parameter Estimation in Spatial Generalized Linear Mixed Models with Skew Gaussian Random Effects using Laplace Approximation

 Spatial generalized linear mixed models are used commonly for modelling non-Gaussian discrete spatial responses. We present an algorithm for parameter estimation of the models using Laplace approximation of likelihood function. In these models, the spatial correlation structure of data is carried out by random effects or latent variables. In most spatial analysis, it is assumed that rando...

متن کامل

Bayesian Inference for Spatial Beta Generalized Linear Mixed Models

In some applications, the response variable assumes values in the unit interval. The standard linear regression model is not appropriate for modelling this type of data because the normality assumption is not met. Alternatively, the beta regression model has been introduced to analyze such observations. A beta distribution represents a flexible density family on (0, 1) interval that covers symm...

متن کامل

Flexible results for quadratic forms with applications to variance components estimation

Abstract: We derive convenient uniform concentration bounds and finite sample multivariate normal approximation results for quadratic forms, then describe some applications involving variance components estimation in linear random-effects models. Random-effects models and variance components estimation are classical topics in statistics, with a corresponding well-established asymptotic theory. ...

متن کامل

Estimation with Norm Regularization

Analysis of non-asymptotic estimation error and structured statistical recovery based on norm regularized regression, such as Lasso, needs to consider four aspects: the norm, the loss function, the design matrix, and the noise model. This paper presents generalizations of such estimation error analysis on all four aspects compared to the existing literature. We characterize the restricted error...

متن کامل

Generalized Dantzig Selector: Application to the k-support norm

We propose a Generalized Dantzig Selector (GDS) for linear models, in which any norm encoding the parameter structure can be leveraged for estimation. We investigate both computational and statistical aspects of the GDS. Based on conjugate proximal operator, a flexible inexact ADMM framework is designed for solving GDS. Thereafter, non-asymptotic high-probability bounds are established on the e...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008